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ABSTRACT 

We present a set of mock redshift catalogs, constructed from N-body simulations, designed to mimic 
the DEEP2 survey. Galaxies with a range of luminosities are placed within virialized halos in the 
simulation using a variant of the halo model. The halo model parameters are chosen to reproduce local 
clustering and abundance data and assumed to be independent of redshift. This allows us to predict the 
luminosity function, two-point correlation function, luminosity- and scale-dependent bias, redshift space 
distortions etc. of our galaxies at higher rcdshifts. We show that the low order clustering properties are 
consistent with preliminary DEEP2 data. The catalogs can be used to evaluate the selection effects of 
the survey and to test new algorithms and statistics that are to be used in the analysis of DEEP2 data. 

Subject headings: Galaxies:high-redshift — Cosmology: theory 



1. INTRODUCTION 

As our theoretical understanding of structure formation 
has advanced and the questions we ask of modern redshift 
surveys have sharpened, the use of realistic mock catalogs 
has become widespread. Almost all modern galaxy red- 
shift surveys have made extensive use of mock catalogs 
both in their design and analysis. 

Mock catalogs which reproduce the low order clustering 
statistics and galaxy redshift distributions can serve to 
test algorithms for biases and quantify the impact of nu- 
merous observational selection effects. They are an indis- 
pensable tool in interpreting a wide range of observational 
data. A suitably constructed mock catalog can also allow 
us to connect the observations directly to modern models 
of structure formation, which are based exclusively on hi- 
erarchical build-up of dark matter halos in an inflationary 
cold dark matter universe. 

In this paper we describe in some detail the construc- 
tion of mock catalogs currently being used by the DEEP2 
redshift survey (Davis et al. |20Q2| . These mock catalogs 
are created by populating large, high-resolution dark mat- 
ter simulations of the ACDM cosmology with galaxies in 
such a way as to reproduce the DEEP2 observations as 
closely as possible. The mock catalogs have undergone 
a continual evolution over the last few years, and we in- 
tend to continue to refine these mock catalogs as further 
observations become available. In this paper we describe 
the current status of the programme and the mocks which 
are being used in analysis of current DEEP2 data. We 
describe some of the properties of the galaxies in the sim- 
ulations which will be useful in interpreting future DEEP2 
observations and finish by describing directions for future 
work. 

The outline of the paper is as follows. In S]5|we place 
our work in the context of previous attempts to make mock 
catalogs. The N-body simulations and halo model are de- 
scribed in ^ We describe the intrinsic properties of our 
mock galaxies, derived from single time outputs of the sim- 
ulation in 2] and how we turn these 'galaxy boxes' into 
lightcone outputs in After presenting some basic re- 
sults in f|Slwe describe the limitations of the current gen- 



eration of mock catalogs and some directions for future 
development in J7| We conclude in ^S] 

2. MOCK CATALOGS 

Mock catalogs have been produced in a variety of ways 
over the years. Early work on mock catalogs from N- 
body simulations (e.g. van Haarlem, Frenk & White 1997'; 
Cole et al. nMHl Yoshida et al. lMTTl Hamana et al. .2002,) 
worked with relatively low resolution simulations, which 
were unable to cover a large volume while resolving the 
halos hosting galaxies of interest. For this reason they 
made simple prescriptions for tagging individual dark mat- 
ter particles within the simulation as 'galaxies'. The prob- 
ability of a DM particle becoming a galaxy was adjusted 
to ensure the right mean number density of galaxies and 
was typically a function of the local density, smoothed on 
relatively large (Mpc) scales. 

Jing, Mo & Borner H1998|l anticipated a halo based ap- 
proach, using what are by modern standards very small 
simulations. The particles in their simulation had masses 
comparable to galaxy halos and when computing corre- 
lation functions they gave such 'galaxies' a weight which 
was a decreasing function of halo mass. This enabled them 
to reproduce the clustering statistics of the LCRS sample 
(Shectman et al. 11996(1 . An approach along similar lines, 
but more akin to the procedure we adopt, was pioneered 
by Kauffmann, Nusser & Steinmetz (11997() , who populated 
dark matter halos in a higher resolution N-body simulation 
with 'galaxies'. Again individual dark matter particles 
were tagged as galaxies, but the particles were now halo 
members and the number of halos was taken from semi- 
analytic recipes for galaxy formation. The key assumption 
of this approach was that galaxies form in virialized dark 
matter halos, and that the properties of galaxies within 
these halos was a function mostly (or entirely) of the halo 
mass. 

Mock catalogs based on a full-blown semi-analytic model 
of galaxy formation were first presented in Diaferio et 
al. H1999|l . using the simulation described in Kauffmann 
et al. I|1999p . Similar work has been presented by Benson 
et al. ifMTUjl . Somerville & Primack ((TOU^ and Hatton et 
al. H2003p . Numerous mock catalogs have been produced 
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from these highly detailed models. Of particular relevance 
to the present paper is the work of Coil et al. H^UUlfl . who 
used semi-analytic models to create mock catalogs for the 
DEEP2 survey. The simulations presented in this paper 
make a number of technical improvements over these ear- 
Her catalogs. 

While the semi-analytic models have many advantages, 
it is difhcult to adjust them to match the known properties 
of any particular observational sample, which is sometimes 
useful when trying to test algorithms. What we desire is a 
model which situates galaxies within their correct cosmo- 
logical context, but allows us to modify their properties 
within reasonable bounds to match the data more closely. 

The halo model allows us to make progress in this di- 
rection. As recognized by Peacock & Smith H2Ul)U|l . the 
key ingredient in the semi-analytic modeling is the halo 
occupation distribution (HOD), which describes the num- 
ber of galaxies of a certain type in a halo as a function of 
the host halo mass. These authors made mock catalogs 
with HODs which were fitted to observations rather than 
produced by semi-analytic models. However they did not 
attempt to make catalogs tuned to any particular survey. 
Bullock, Wechsler & Somerville ( 2002 ) used the low order 
statistics of Lyman Break Galaxies (LBGs; see references 
in Bullock et al.) to constrain a HOD for LBGs, but again 
did not build any mock catalogs. Zhao, Jing & Borner 
(|2)02) built mock catalogs to study the pairwise veloc- 
ity dispersions of LBGs, using a HOD formalism. Similar 
work was presented by Scoccimarro and Sheth ((2002 ) who 
used a HOD fit to the clustering seen in the PSCz (Hamil- 
ton & Tegmark VMM Szapudi et al. EnD» Feldman et 
al. 2001) survey to populate halos. Their halo distribution 
was not based on N-body simulation but produced using 
an algorithm (PT-HALOS) developed from perturbation 
theory. Mock catalogs based on high resolution N-body 
simulations, which resolve the halos which host galaxies 
in the relevant range of luminosities, and tuned to match 
both 2MASS (White & Ko chane k I^UU^ Kochanek et al. 
nUU^ and 2dF (Yang et al. Yan, Madgwick & White 

2003) have been produced in the last few years. This pa- 
per presents a new set for the high-z galaxies probed by 
DEEP2. 

3. SIMULATIONS AND GALAXY BOXES 
3.1. N-body simulations 

The basis for the mock catalogs is a series of high resolu- 
tion N-body simulations of structure formation in a ACDM 
universe which were run using the TreePM code (White 
I2002|l . Each of the simulations has 512'^ dark matter par- 
ticles in a cubical box with periodic boundary conditions. 
The large particle number allows a wide dynamic range 
in mass and length scale to be simulated. The box sizes 
and other information can be found in Table ^ Unless 
otherwise stated we will draw our examples from Model 4. 

We use outputs from z ~ 1.5 to z ~ 0.6 for DEEP2 
fields with a photo-z cut. For each output we produce a 
halo catalog by running a "friends-of-friends" group finder 
(e.g. Davis et al. I1985|l with a linking length 6 = 0.15 (in 
units of the mean inter-particle spacing). This procedure 
partitions the particles into equivalence classes, by linking 
together all particle pairs separated by less than a dis- 
tance b. We keep all halos with more than 8 particles, 



and consider each of these halos as a candidate for host- 
ing 'galaxies'. The halo mass is estimated as the sum of 
the masses of the particles in the FoF halo, times a small 
correction factor (typically 10%) which provides the best 
fit to the Sheth- Tormen mass function (Sheth & Tormen 
I1999|l . We additionally compute a number of statistics (ve- 
locity dispersion, virial mass etc) based on the dark matter 
distribution in each halo. This information is propagated 
through the remaining steps in building the mock observa- 
tions, so that the halo properties and galaxy memberships 
can be provided to the user if necessary. 

3.2. Galaxy populating using HOD and CLF 

For each output of the simulation we populate halos with 
'galaxies' by choosing certain simulation particles to be 
galaxies and assigning them luminosities. The number of 
galaxies more luminous than some L^nt hosted by any halo 
is determined by the halo occupation distribution (HOD) 
which we compute using the conditional luminosity func- 
tion formulation of Yang et al. ( 2003 ) . The actual number 
of galaxies is drawn from a distribution for each halo in 
the simulation which is Poisson for large galaxy numbers 
but drops below Poisson for smaller numbers. For more 
details, see Yan et al. H2003|l . 

Once the number of galaxies in each halo is known, they 
are assigned luminosities using the conditional luminosity 
function (CLF) <i>(L|M) which describes the luminosity 
function (LF) of galaxies in halos of mass M . This func- 
tion is constrained from observations of the LF and clus- 
tering statistics of the survey in question (Yang et al. 120031 
van den Bosch et al. 1^01751 Yan, Madgwick & White l^nUSjl . 
Within the constraints set by current DEEP2 data there 
are a range of cosmologies and CLFs which would be ac- 
ceptable. We further constrain the parameter space by 
choosing models which fit the 2dF LF at z ~ (Madg- 
wick et al. 12002(1 and both the published 2dF and lower-z 
DEEP2 correlation functions (Madgwick et al. I2003al and 
Coil et al. 2003, respectively). The method for generat- 
ing the models is described in detail in Yan, Madgwick & 
White (2003) . For each N-body simulation listed in Table 
n except the first, we list in Table [3 one CLF model which 
fits the low-z data well. For simulation 1, with a high 
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1 


300 


20 


1.7 


1.0 


0.70 


1.00 


2 


300 


20 


1.7 


0.8 


0.70 


1.00 


3 


128 


9 


0.1 


0.9 


0.70 


0.95 


4 


256 


18 


1.0 


0.9 


0.70 


0.95 


5 


192 


13 


0.4 


0.8 


0.67 


1.00 



Table 1 

The parameters of the simulations which are being 

USED TO make mock CATALOGS. AlL SIMULATIONS ARE OF 
THE ACDM FAMILY WITH fimat = 0.3 = 1 — f^A. THE 
SIMULATIONS WERE STARTED BETWEEN 2: = 50 — 100 AND 
RUN TO THE PRESENT WITH OUTPUT TIMES SPACED EVERY 

0(100 /i"^Mpc) FROM 2 ~ 2. The force softening was of 

THE SPLINE FORM, HELD CONSTANT IN COMOVING 
COORDINATES, AND A "PlUMMER EQUIVALENT" SMOOTHING 
IS QUOTED IN THE 2ND COLUMN. 
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No. 


2 


3&4 


5 


ai5 


-0.995 


-1.194 


-1.274 


V 


-0.198 


-0.280 


-0.269 


Ml 


2.22e+ll 


1.46e+ll 


9.38e-l-10 


M2 


7.46e+ll 


1.06e+12 


1.21e+12 


7i 


2.649 


1.874 


3.612 


72 


0.496 


0.366 


0.453 


73 


0.656 


0.673 


0.622 


{M/L)o 


93.25 


120.1 


102.0 


Ms 


7.83e+12 


1.04e+13 


3.86e+12 



Table 2 

The CLF model parameters for each model we use. 

For the meanings of these parameters, see Yan, 
Madgwick & White (I2U03II and references therein. 
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Fig. 1. — The halo occupation distribution we use. The three hnes, 
from top to bottom, are N{M) for galaxies brighter than O.IL,, 
0.3L« and l.OL*, respectively. The little fan shows several different 
slopes for comparison. 



matter clustering amplitude and a relatively low mass res- 
olution, we were unable to find a model which fits the data 
well and which did not have a large fraction of the galaxies 
in halos below our mass resolution. To avoid multiplica- 
tion of figures and tables we choose to show results for only 
simulation 4 for the rest of this paper. The main results 
are unchanged for the other models. 

The HODs of model 4 for three different luminosity cuts 
are shown in FigureQand the model parameters are listed 
in Table [3 AH of the models we have chosen look qual- 
itatively similar to the one shown in Figure there is a 
sharp cut-off at low mass below which the formation of 
'bright' galaxies is inefficient. There is a shelf or shoulder, 
which is required to match the number counts of galaxies, 
before an approximately power-law rise to high mass. For 
the models which match DEEP2 and 2dF the fits all pre- 
fer a (-/Vgai) which rises more slowly than the halo mass 
itself, indicating a rising mass-to-light ratio. This is not 
unexpected in samples selected in the rest-frame blue or 
ultra-violet (there is some evidence that if -band selection 
provides a steeper high end slope (Kochanek et al. 12003(1 1. 
That the DEEP2 galaxies are relatively underrepresented 
in massive halos will have consequences for cluster finding, 
velocity fields and the determination of cosmological pa- 
rameters from redshift space distortions as we shall discuss 
later. 

Given the total number of galaxies brighter than Lent 
in each halo it is necessary to choose luminosities for them 
based on $(L|M). Just as it was necessary to specify both 
(N) and the higher moments above, it is necessary to know 
the fluctuations about the mean ^{L\M) at this stage. If 
the luminosities of galaxies in each halo are drawn inde- 
pendently from $(L|M) then one occasionally finds rel- 
atively low mass halos with two 'bright' galaxies. Since 
such systems have small radii this in turns implies an in- 
crease in the 'bright' galaxy correlation function at small 
scales. Such an increase can indeed be seen in some of the 
semi-analytic models, but appears to be absent in data. 
This suggests that some mechanism acts to suppress pairs 
of bright galaxies in small halos. We can model this in a 
number of ways. On one extreme we could calculate the 
luminosities for all galaxies in halos of similar masses by 
drawing from ^{L\M) and then distribute them, round- 
robin, in halos in order of decreasing luminosity. This 
ensures that all the bright galaxies are partitioned among 
the halos rather than having pairs end up in any one halo. 
A slightly different approach, which has very similar clus- 
tering properties, was suggested by Yang et al. H2003|l . 
Here we compute Li such that a halo of mass M has (on 
average) only 1 galaxy brighter than Li. We then draw 
luminosities for the galaxies in this halo, allowing only the 
brightest galaxy to have L > Li. This also suppresses the 
higher moments of ^{L\M) for bright galaxies. We shall 
follow Yang et al. ( 2003j) unless stated otherwise. 

Once the luminosities are given, the most luminous 
galaxy is assigned to the center of mass of the halo, and 
the other galaxies are assigned to random particles within 
the halo. While the code allows the possibility of a radial 
or velocity bias in assigning galaxies to particles, through- 
out we assumed that galaxies traced the mass and velocity 
distribution of the halo (inheriting its shape and any sub- 
structure) . 
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4. RESULTS FROM THE GALAXY BOX 

In this section, we present some results computed from 
these simulated galaxy boxes. These are not directly rele- 
vant to observations, in that they do not include evolution, 
survey geometry, slitmask making, target selection and so 
forth. However they allow us to study the intrinsic prop- 
erties of the mock galaxies to set a baseline for what mock 
observations should later recover. 

We present in Table|31some basic properties of all of our 
models. Each row of the table is discussed in correspond- 
ing sections below. 

4.1. Luminosity Function 

We first compare the luminosity function and correlation 
function with the data used to constrain the model. 

Figure[21shows that the simulation reproduces quite well 
the 2dF LF at redshift z = 0.1 (Madgwick et al. I^UH^ . 
which we used as an input to the model generation pro- 
cess. There is a slight excess of faint galaxies in the boxes, 
which can be traced to an excess of low-mass halos over the 
Sheth-Tormen prediction. These excess groups are likely 
chance associations which are being classified as groups by 
the FoF procedure at the lowest masses. However the ex- 
cess has a minimal effect on the light-cone outputs we will 
generate. Due to the increasing volume and Lcut with red- 
shift only a small fraction of galaxies have L < 0.2L*(z) 
in the survey. 

The LF at z = 1 has fewer relatively bright galaxies than 
at z = since there are fewer massive halos. At first sight 
this appears contrary to the results from the COMBO-17 
survey (Wolf et al. 2003J. However we have plotted the 
luminosity in units of L*, which is a function of redshift. 
As we will show later, a pure L*(z) evolution produces 
a LF evolution consistent with that seen by COMBO-17. 
These LFs are relatively well fit by a Schechter function, 
but the evolution of the Schechter parameters is difficult 
to quantify because of the large covariance between them. 

4.2. Correlation Function and Bias 



No. 
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3&4 


5 


b{L,) 


1.46 


1.33 


1.45 




0.22 


0.19 


0.22 




5.31 


5.43 


4.41 




53 


52 


83 


7 


0.89 


0.80 


0.61 



Table 3 

Some useful properties of the different models, all 
evaluated at z = 1. the mean bias at l* and the 
slope of the bias with luminosity are evaluated 

analytically using the clf formulation of the halo 

MODEL. The (COMOVING) CORRELATION LENGTH, ro, FOR 
L, GALAXIES IS COMPUTED FROM THE Z = 1 OUTPUT OF THE 
N-BODY SIMULATIONS AND QUOTED IN MPC. THE LAST 

TWO ROWS CORRESPOND TO THE PARAMETERS IN EQ. EI A 

FIT TO THE GALAXY VELOCITY DISPERSION VS. HALO 
RICHNESS. The numbers quoted here are FOR GALAXIES 

brighter than 0.3l* and the velocities are in 
(physical) KMS"""^. 




0.1 l.G 10.0' 

L/L* 

Fig. 2. — The luminosity functions computed directly from the 
galaxy boxes at z ~ 0.1 (solid line) and z ~ 1.0 (dashed line). The 
LF at z ~ 0.1 is compared with that observed by 2dF (Madgwick et 
al. i2002il (points). Notice that the x-axis is L/L,(z), which scales 
out the pure luminosity evolu tion . For the LF evolution plotted in 
fixed magnitude bins, see Fig. 1131 
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0.1 1 ^ ^ ^ ^ , , ,1: 

0.1 1.0 10.0 

r (Mpc/h) 

Fig. 3. — The correlation functions for all galaxies brighter than 
O.IL*, 0.3L, and 1.0L» in the box at redshift z = 0.85, compared 
with the DEEP2 measurements. Here L« is Lt,{z = 0.85). The sim- 
ulation ^(r) is computed from the periodic box, knowing the mean 
density of galaxies precisely, so there are no edge effects or integral 
constraint issues. The difference among the three lines shows the 
luminosity dependence of the bias. 
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In FigureOlwe show the correlation function of all galax- 
ies brighter than 0.1 L^{z) in the box at z — 0.8. This is 
compared to the data published in Coil et al. H2()()8|l and 
again we find good agreement ^ . The observed ^ (r) is de- 
rived from the projected two point correlation function, 
Wp(rp), assuming ^(r) is a power law. This is quite a good 
approximation for our galaxies on the scales of interest, 
and we will show later f ti5.3|l that our Wp{rp) in the mock 
lightcones is in good agreement with the data as expected 
from this comparison. 

The bias of the galaxies in our models is both lumi- 
nosity and scale dependent and stochastic, as seems to 
be the case for real galaxies (Norberg et al. I2(JU1I Nor- 
berg et al. 2002, Pea cock 1 997; Ha milton fc Tegmark l^UU^ 
Tegmark & Br omleylTM^ Blanton lWini Gray et al. 12002 
Hoekstra et al. l2002f) . In Figure^we show the scale depen- 
dent bias for all galaxies brighter than 0.1L^{z), defined 
as 

for several different redshifts. In selecting the models we 
emphasized those in which the luminosity dependence of 
the bias on large scales followed closely the measurements 
made in the 2dF survey (Norberg et al. l2001|l who find 



0.85- 



0.15 ( ^ 



(2) 



While the results are noisy due to the finite volume of the 
simulation box, the power spectrum measurements bear 

^This data comes from an inversion of Wp(rp), so does not corre- 
spond to a particular luminosity cut. It is approximately that of a 
sample of luminosity brighter than O.IL* for L, at 2 = 0.8 as shown 
by Fig. El 



out this expectation, though at smaller scales there are 
obviously departures from this scaling. 

Using the variance in counts of cubical cells we find 
at z = 1 that the cross correlation, r, between DM and 
galaxies is above 95% on scales larger than 8h~^Mpc but 
drops to < 60% by 0.5/i~^Mpc indicating that the bias 
is stochastic. There are insufficient galaxies in the box 
to probe much smaller scales but we expect significant 
stochasticity in the relation between mass and galaxy den- 
sity on those scales. 

At redshift z = 1, the bias is larger than at z = in 
models with a fixed HOD for all redshifts since halos of a 
given mass are rarer and more biased at a higher redshift. 
The luminosity-dependence of the bias is similar to that 
in the local universe but with a steeper slope. In Table |3| 
we show the large-scale linear bias, b{L^,), and the linear 
coefficient of the luminosity-bias relation around L*(z), 
b'{L^,), for all of our models at z = 1. Also shown is the 
correlation length, rg, for L > galaxies in each model 
at z = 1. 

The amplitude of galaxy clustering is non-monotonic 
with redshift, as expected. Figure |S1 shows that for galax- 
ies brighter than 0.1L*(z) the comoving correlation length, 
ro(z), has a minimum near z ~ 1.2 in these models. Above 
this redshift the increasing galaxy bias dominates over the 
decreasing mass clustering leading to an increase in the 
observed clustering strength. Below this redshift the evo- 
lution of the bias is more modest than that of the mass 
clustering. 

4.3. Redshift Space Distortion 

Of course the observations are done in redshift, rather 
than real, space so it is of interest to examine the redshift 
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Fig. 4. — The scale dependent bias for several different redshifts. 
The bias is defined through the ratio of correlation functions. 
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Fig. 5. — The redshift evolution of the (comoving) correlation 
length, ro, for galaxies brighter than 0.lLt{z). The observed clus- 
tering power has non-trivial evolution due to the interplay of bias 
and mass clustering evolution. 
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space distortions in the simulations. We first show the 
distortions in the correlation function and then look at 
individual halos. 

To compute the redshift space correlation function we 
make the assumption that the line-of-sight is parallel to 
the z-axis of the simulation box and define the redshift 
space position as s — z + u where z is the redshift con- 
verted to comoving distance along the line of sight and u is 
the velocity converted to a line-of-sight distance difference 
across which the Hubble flow is equal to the peculiar veloc- 
ity. Figure^shows ^(r^, n) computed from the galaxy box 
for galaxies brighter than 0.1L*(z) a,t z — 1.0. We see the 
expected "butterfly" structure, with large-scale enhance- 
ment of the clustering by super-cluster in-fall fKaiser ll987(l 
and small-scale suppression of clustering by the virial mo- 
tions within halos (the finger-of-god effect). Transverse 
to the line-of-sight the clustering length is enhanced from 
r ^ 4/i^^Mpc to 5.5/i~^Mpc. Along the line-of-sight the 
virial motions within halos are still large enough to in- 
crease the clustering length to 8/i~^Mpc. 

We investigated evolution of ^(rp, tt), but found that the 
contours show very little change over the redshift range 
0.5 < z < 1.5. This might be in contradiction with DEEP2 
observations (Coil et al. I2003|l which show little flnger-of- 
god effect in the higher-z sample (0.9 < z < 1.35). The 
source of this disagreement is currently unclear. As we 
show in Figure \7\ in the currently popular cosmologies 
the halo population is not evolving dramatically across 
the redshift range probed by DEEP2. The reduction in 
the finger-of-god effect must thus be due to the way in 
which galaxies are populating the dark matter halos or 
to an observational selection effect. We have assumed a 
constant CLF throughout, which becomes less likely as we 
probe closer to the epoch at which the majority of these 
galaxies are forming. It may be that the higher redshift 
galaxies are increasingly to be found outside of massive 
halos. It is also possible that at higher redshift DEEP2 
is beginning to pick up a different population of galaxies, 
which is significantly underrepresented in massive halos. 
On the observational side the more distant galaxies need to 
be brighter to make it into the sample and their host halos 
subtend a smaller angle on the sky. Both the increased 
luminosity cut and the constraints imposed during DEEP2 
slitmask making could reduce the finger-of-god effect from 
pairs of galaxies lying within the same halo. We intend to 
investigate this, and several other issues related to redshift 
space distortions, further in a future paper. 

We can also look at the effect of virial motions on small- 
scale clustering by focusing on halo velocity dispersions. 
The dark matter halos in all of the simulations obey a 
reasonably tight M — a relation which follows the virial 
scafing: 

where M200C is the mass inside a sphere interior to which 
the mean density is 200 x the critical density, E{z) = 
H(z)/Hq is the dimensionless Hubble parameter and cr200 
is the one dimensional velocity dispersion of the dark mat- 
ter particles within the same radius. The scatter around 
this relation is near 5% in a for < z < 2. The mean 
richness of halos of mass M can be inferred from Figure 




Fp (Mpc h"') 

Fig. 6. — The 2-D correlation function in the galaxy box in real 
space (dashed line) and redshift space (solid line) at ^ = 1.0. The 
thickest lines indicate C()"p,7r) = 1. 




Fig. 7. — The cumulative (comoving) number density of halos 
with velocity dispersion larger than (T20O1 computed from the N- 
body simulation, as a function of redshift. 
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n taking care to correct M200C to the Migoh used in the 
Sheth-Tormen mass function (Migoh = M54C ~ I.4M200C 
if Jlniat = 0.3; see e.g. White We can also com- 

pute the (ID) galaxy velocity dispersion using the known 
halo membership of galaxies in the box. Even though we 
have taken the satellite galaxies to have no velocity bias in 
these simulations, the galaxy velocity dispersion is system- 
atically lower than the halo velocity dispersion because we 
assume that the brightest galaxy in the halo sits at the 
halo center and inherits the halo center of mass velocity. 
This is shown in Figure |S1 where we plot the dispersion 
computed from galaxies brighter than 0.3L*(z), as appro- 
priate for galaxies at z ~ 1 (see FigureEl, against the halo 
velocity dispersion. 

It is also interesting from the point of view of finding 
groups in the DEEP2 data to consider the cumulative (co- 
moving) number density of groups as a function of velocity 
dispersion. This is shown in Figure [TUl at fixed redshift, 
again using the known halo membership of the galaxies 
and only considering galaxies brighter than 0.3L*(z) at 
z = 1.0. 

While there is a large scatter, the richness of the groups 
is correlated with the galaxy velocity dispersion. We list in 
Table|2|the best fitting parameters to a power-law relation 
of the form 

agai = <yoNl, (4) 

for galaxies brighter than 0.3L*(z) at z = 1 in each model 
we use. 

5. SURVEY GEOMETRY REALIZATIONS AND MOCK 
OBSERVATION 

The DEEP 2 redshift survey is very nearly a pencil-beam 
survey. In particular the 1-hour-survey, on which we shall 
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focus here, will cover 4 fields each 120' x 30'. For our 
assumed cosmology this translates into ~ 20 x 80 h^^Mpc 
at z = f. The survey depth extends to z ~ f.5 and a 
photo-z cut will be applied on 3 of the 4 fields to select 
only galaxies with z > 0.7. For the remaining field no cut 
will be applied. Thus for the fields with a photo-z cut 
the mock catalog should be a light-cone ^ f 300 /i^^Mpc 
(comoving) long. For the field without a photo-z cut, this 
would be ~ 3000/i^iMpc. 

Our simulation volume is 8 times as large as the survey 
light cone without a photo-z cut, and even larger in the 
case of the photo-z cut. Thus suggests we can make ap- 
proximately 8 independent mock catalogs per model from 
the sequence of outputs we have available. Additionally 
the galaxies in earlier outputs are made independently of 
those in later outputs so the effects of repeated structure 
are mitigated to some extent by the galaxy population 
process. 

To mimic the DEEP2 geometry we stack our galaxy 
boxes to produce light-cone outputs. The field of view 
is oriented so that the line-of-sight is offset by a few de- 
grees from one axis of the box. In this way we can trace 
through the structure in a continuous manner, using the 
periodicity of the box, without retracing the same struc- 
ture every box crossing. We use earlier and earlier outputs 
as we trace back along the line-of-sight. In principle this 
can lead to discontinuities as the structures in the box 
'jump' between outputs. The time sampling is sufficiently 
dense however that this is not an issue^. 

Given the distances and luminosities of the galaxies in 
the light-cone we obtain magnitudes through the defining 
relations 

Afs^. =Af4.(z)-2.51og|- (5) 

mR = Mb, ~5 + 5logio D + K{z) (6) 

where M is the absolute magnitude, m is the apparent 
magnitude, D is the luminosity distance (in pc) and K{z) 
is the iiT-correction. Since we used the Bj-band 2dF LF to 
tune the CLF to assign luminosities, and DEEP2 is select- 
ing targets in R-band, we need a X-correction which trans- 
fers rest-frame Bj band magnitude into observed-frame R- 
band magnitude. Our K{z) is defined as 

K{z)^Riz)-B,iO) (7) 

and we use a quadratic fit over the scattered if-correction 
points of galaxies in DEEP2 early data from Willmer et 
al. H2004|l to get an averaged JC-correction over spectral 
types: 

K{z) = -2.85 + bMz- 2 A9z^ (8) 

valid for 0.6 < z < 1.4. As we will discuss later, the 
y-intercept of the if -correction does not influence the red- 
shift distribution of galaxies in the mock catalog, since we 
will normalize it to the observed number counts of galaxies 
in the photometric catalog. 

We write as a function of z because it is observed 
that galaxies at high rcdshift (z 1) are intrinsically 

^By design light travels 0(100 /i~^Mpc) between output 
times. Thus a halo moving with velocity 300km/s moves only 
0(100 /i~^kpc) between outputs. We have tested that this is small 
enough for our purposes 



brighter than local galaxies. We postulate a linear evo- 
lution 

M4(z) = M4(z = 0)-ife^ . (9) 

and use the 2dF value M^.iz = 0.1) - Slog^o h = -19.79 
(Madgwick et al. 2002) to compute the zero point. By 
studying the fundamental plane in the rest-frame B band, 
d\og{M / L) B / dz has been determined to be around 0.4 — 
0.7 (van Dokkum & Stanford 2&^ Rusin et al. lMTl Treu 
et al. 2002, and references therein). These numbers cor- 
respond to a Ke between 1.0 and 1.75, if galaxies in 
the past have the same mass as those of today. In order 
to fit the DEEP2 number density and COMBO-17 LF we 
choose Ke = 1, and assume the linear relation of Eq. |51 

The evolution of is an important assumption that 
we are putting into the mock catalogs. It strongly affects 
the final z-distribution of galaxies in catalog and we will 
discuss it further in W>.\\ It also determines which range of 
intrinsic galaxy luminosities dominate the DEEP2 sample. 
Because of the shape of Lcut/-^* shown in Figure |^ the 
sample tends to be dominated by galaxies with L ~ Q.bL^. 

Finally, DEEP2 uses a photo-z cut designed to select 
only galaxies at z > 0.7. This cut is not a sharp cut in 
redshift space and the final z distribution turns out to have 
a peak at z ^ 0.8 (see Figure [TT|) and a foreground tail that 
extends to quite low redshift. To mimic the effects of this 
cut in our lightcones we applied a probabilistic foreground 
cut-off to galaxies with z < 0.8. 

6. BASIC RESULTS FROM MOCK CATALOGS 

By stacking the galaxy boxes in different directions and 
viewing the simulations from different positions we pro- 
duced 12 almost independent mock catalogs each of which 
has the size of a DEEP2 field (or 36 almost independent 
pointings) . 

Since the DEEP2 survey is so narrow, with a transverse 
dimension not much larger than the non-linear scale, we 
expect sample variance to be a significant issue which we 
can quantify with the mock catalogs. The simplest statis- 
tic is the total counts: the number of galaxies brighter 
than the magnitude limit of the survey in each field be- 
tween redshift 0.7 and 1.35. The fractional variance in 
this number is 5.7%. For each pointing, which is a third 
of a field, the fractional variance is 8.0%. We turn now to 
consider less coarse measures. 

6.1. The cone diagram and the z-distribution 

Figure 1121 shows a cone diagram from one of the mock 
catalogs generated from model 4 of Table ^ Because the 
DEEP 2 geometry is almost a pencil beam, we have bro- 
ken the redshift direction up into 3 pieces which we have 
plotted next to each other. All of the galaxies at each 
redshift have been projected into the slice as a function 
of their transverse coordinate. The large scale structure 
is visually apparent in the slices, with groups and clusters 
separated by voids. It appears from this figure as if most 
galaxies lie in large structures but actually ~ 80% galaxies 
in the cone are isolated (i.e. only one galaxy in their par- 
ent halo passes the DEEP2 selection). Of the dark matter 
halos with at least 1 galaxy passing the DEEP2 selection, 
^ 90% host only 1 galaxy passing the cut. 

Figure ITTI compares the z distribution in the mock cat- 
alogs with that in the DEEP2 data. The observed z dis- 
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tribution is obtained from three separate fields covering a 
total of 0.72 square degrees, which is equivalent to the area 
of two 'pointings'. The mocks have similar z distribution 
to the real data. The total number of galaxies differs be- 
tween the mock pointing and the real data due to various 
observational effects including the difference in sky area, 
the effects of slitmask making and the a non-zero redshift 
failure rate. Also the DEEP2 selection is not complete 
beyond z = f .35 due to the way slitlets are positioned on 
the mask, making it difficult to compare to the data above 
z = f .35. 

As we mentioned earlier, the evolution of i» and the 
ii'-correction are important factors affecting the z distri- 
bution of galaxies. We adjust the zero point of these two 
factors to match the number counts of galaxies^ brighter 
than the magnitude limit of the survey. Since at present 
we quote only the luminosity in a single band, this cor- 
rection shows up in a combination of the zero points of 
the A'-correction and local measurement. (In practice, 
this requires that we increase the apparent magnitude of 
each galaxy by 0.1 to better match the number density. A 
better treatment of if-corrections requires a knowledge of 
galaxy types in the mock catalogue, which we have not im- 
plemented yet.) However, the shape of the if-correction 
and the coefficient of the evolution, K^, are crucial 
for the final z distribution. Figure El shows the minimum 
luminosity as a function of z. Notice that at z ~ 0.7, 
the minimum luminosity of DEEP2 spectroscopic targets 
is around O.IL*, which means for the three fields with a 
photo-z cut our mock catalogs probe sufficiently far down 
the LF. For the field without a photo-z cut we need to 
make use of a smaller box, with higher mass resolution, to 
extend the light-cones to smaller redshift. 

6.2. Luminosity Function 

Fig El compares the LF of the mock with that mea- 
sured from COMBO-17 in rest frame Bj band for all SED 
types. We measured the LF for galaxies with redshift be- 
tween 1.0 and 1.2 in each mock using 1/V^nax method, as 
was done for COMBO-17. As is well known, the 1/T4nax 
method is very sensitive to fluctuations in number den- 
sity due to large scale structure in the sample. We no- 
ticed this when comparing the results from 36 different 
mock catalogs, some of which were a factor of two off the 
mean which is plotted in Figure E| Since the field of 
view of COMBO-17 is comparable to a DEEP2 pointing, 
we regard the difference between the COMBO-17 LF and 
the mocks as not statistically significant. A preliminary 
LF from DEE P2 is basically consistent with COMBO-17 
(Willmer et al. 12004(1 . suggesting our mock catalogs are an 
adequate approximation to the DEEP2 data. 

6.3. Projected Correlation Function 

In addition to the correlation function shown in Figure 
121 we computed the projected correlation function Wp{rp) 
from the mock catalogs. This allows us to compare di- 
rectly with the measurements (Coil et al. l2UU3|l from early 
DEEP2 data. The projected correlation function is almost 
independent of redshift-space distortions, since it only uses 

^When matching the number counts, we assume that among the 
galaxies with z > 0.7, 15% are at redshifts z > 1.4, beyond the range 
of DEEP2. 
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the transverse spacing between objects. It is defined as 

/•OO 

Wp{rp)^2 ^[rp,n)diT . (10) 

JO 

In order to make a direct comparison with the Coil et 
al. H2()():^|l data we computed Wp{rp) as described in that 
paper. We also divided the mock sample (0.7 < z < 1.35) 
near its median redshift to make two subsamples: lower- 2; 
(0.7 < z < 0.9) and higher-z (0.9 < z < 1.35), performing 
the Wp(rp) calculation on each of them. First ^{rp, tt) was 
measured for \tt\ < 20/i^^Mpc then summed over the tt 
direction to give Wp{rp). 

Figure n~T] shows the results of Wp{rp) for both subsam- 
ples. The solid lines show the best fit power-law to the 
DEEP2 data from Coil et al. H2003|l . while the dashed lines 
show the fits for the ±la range of the amplitude holding 
the slope fixed. We cannot include the errors on both the 
amplitude and slope because they are highly correlated, 
and the correlation is not quoted in Coil et al. (2003). 
The error on the slope is smaller than in the amplitude 
however, so this range should be a good indication of the 
current uncertainty. The mock points are spread vertically 
because the sample variance is very large from field to field. 
This is a reflection of the narrowness of one DEEP2 point- 
ing. For the lower-z sample, the DEEP2 measurements go 
almost through the densest region of the points, showing 
beautiful agreement between the mocks and the data for 
this statistic. We caution that we did not run our mock 
catalogs through the DEEP2 slitmask-making procedure. 
Coil et al. 112003') claim this will only slightly weaken the 
correlation strength, and a detailed comparison is beyond 
the scope of this paper. 



10 




10 1 1 1 1 .J 

-23 -22 -21 -20 -19 -18 
M 

Fig. 13. — The mean luminosity function of the mock catalogs 
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in the redshift bin 1.0 < z < 1.2. Also plotted is the comparison 
between local 2dF Bj band LF and that from the simulation box, 
showing the evolution with redshift. 



For the higher-z sample, the mocks show slightly stronger 
clustering than the data. For Vp > 1 h^^Mpc the data 
points are covered within the range of the sample variance 
of the mocks, but below lft,~^Mpc they are tantalizingly 
different. This discrepancy will be mitigated a little by 
the complex effects of mask-making, which we have not 
included, but likely not enough to bring the two into agree- 
ment. Further there might be a type-dependent selection 
bias as we move to higher redshift, which also needs to be 
considered. However this result might be telling us that 
the HOD in this redshift range is different from the locally 
calibrated HOD that we have used. This would be very 
exciting if confirmed, as it indicates that further DEEP2 
data can begin to constrain the evolution of the HOD di- 
rectly through measurements of clustering at high-z. 

The correlation function in our mock catalogs is approx- 
imately a power-law. Making a power law assumption we 
can fit Wp{rp) in each pointing to obtain a distribution 
of spectra indices and amplitudes for £,{r). We find that 
the clustering strength, ro, varies strongly from pointing 
to pointing, covering the range 3/i~^Mpc to 4.5 /i~^Mpc. 
The spectral index also varies, and the index and length 
are clearly correlated. In the very near future DEEP 2 will 
be able to obtain clustering measurements over a much 
wider area of sky than we have at present, which should 
reduce the sample variance. According to our mock cat- 
alogs, if correlation functions arc measured over DEEP 2 
fields instead of pointings, the range of vq will shrink by 
one third. 

7. LIMITATIONS AND FUTURE DEVELOPMENT 

We hope these mock will be a valuable aid in evaluat- 
ing the various selection effects of the survey, testing new 
algorithm and statistics which will be used to analyze real 
data. But there are several limitations which must be 
borne in mind when using these mocks. 

(i) The mocks still present an idealized version of obser- 
vations. Real observations have a lot of inevitable 
difficulties which include systematic errors in pho- 
tometry, bright star holes in the field, CCD defects, 
incompleteness in the survey caused by weather, fail- 
ure to get redshifts for all targeted galaxies, etc. 
Some of these difficulties are uncorrectable even with 
the help of mock catalog, and this complicates the 
comparison between mock and reality. 

(ii) We have assumed that our satellite galaxies trace the 
velocity field of the dark matter, so no velocity bias 
(in this sense) is present in the mock. This should 
be borne in mind when comparing galaxy and dark 
matter velocity dispersions or the finger of god effect. 

(iii) The halo model and CLE algorithm are a simpli- 
fication of the complex physics involved in galaxy 
formation. We have chosen a single specific model 
for each cosmology to match observational results. 

To be useful the mock catalogs do not need to be a per- 
fect replica of the DEEP2 data, they simply have to be a 
close enough approximation that they provide a valid test 
of algorithms and methods. The required level of agree- 
ment may need to be assessed on a case by case basis. We 
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view these catalogs as part of a continuing chain of devel- 
opment, as more information is obtained by DEEP2 the 
mocks can be further tuned to increase their fidelity. 

We intend to evolve these catalogs as more observational 
information becomes available. In addition to refinements 
to the CLF parameters and cosmology we are investigating 
several extensions to the information available per galaxy. 
First we can include color information in the catalogs. We 
are implementing a scheme for including colors based on lo- 
cal density and luminosity information, with the model for 
the color distribution drawn from a principle component 
analysis of the data. Second we are investigating the inclu- 
sion of type information similar to that obtained from the 
spectral classification scheme of Madgwick et al. ( 2003Bjl . 
These extensions will be tested against the DEEP2 data 
as it becomes available. 

8. CONCLUSIONS 

We have presented mock catalogs based on high resolu- 
tion N-body simulations which are designed to match the 
low order clustering statistics of the DEEP2 redshift sur- 
vey. The mock catalogs are based upon the halo model for 
clustering and situate galaxies in their correct cosmologi- 
cal context. The simulations used are sufficiently large to 
give many independent realizations for deep pencil-beam 
surveys and have sufficiently dense time sampling to model 
possible evolution information embedded in the survey. 

The mock catalogs provide positions, velocities and lu- 
minosities for galaxies in multiple fields, for a number of 
cosmologies within the ACDM class. The galaxies have 
a non-trivial scale and luminosity dependent bias, which 
is stochastic, but trace the large scale structure showing 
voids, filaments, sheets, walls and clusters. In each case 
the z distribution, luminosity function, correlation func- 
tion and projected correlation function match well the pre- 
liminary results from the DEEP2 survey. 

In the process of building these mock catalogs we have 
noticed several interesting results. 

(i) The halo model and its extensions (such as the CLF) 
provide a powerful tool for building mock catalogs 
for galaxy redshift surveys. Galaxies are put into 
the simulations in a physically motivated way. This 
has two advantages. First, the HOD and CLF can be 
constrained from observations so that it almost guar- 
antees the mocks will mimic reality. Second, tun- 
ing of the model parameters teaches us about galaxy 
evolution. The HOD and its extensions provide an 
ideal meeting ground between theories of cosmology 
and galaxy formation. 

(ii) To get the correct z distribution of galaxies is not 
trivial, indicating that the observed z distribution 
contains a lot of information. Within our formalism, 
the z distribution involves a iiT-correction, evolu- 
tion, halo mass function evolution and CLF evolu- 
tion. The first two factors are measurable, the third 
one depends on cosmology, and the last one on the 
details of galaxy formation. If the if-correction and 

can be well measured, the remaining degen- 
eracy between cosmology and CLF evolution can be 
broken by other measurements, e.g., the fluctuations 
in the z histogram in a narrow beam survey, the LF, 
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counts of groups and their richness, the correlation 
function etc. 

(iii) Our simple assumption of a HOD which does not 
evolve from 2: = to z > 1 provides a reasonable 
fit to the early DEEP2 data. This is a surprising 
result. Additionally there may be hints that a non- 
evolving HOD will fail to fit improved data at higher 
redshift, which could provide valuable information 
on the formation and evolution of galaxies. 

We anticipate that these mock catalogs will be crucial 
in understanding the data coming from the DEEP2 survey 
and will help us to understand the formation and evolution 
of galaxies. 
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